SourceSeer: Forecasting Rare Disease Outbreaks Using Multiple Data Sources
نویسندگان
چکیده
Rapidly increasing volumes of news feeds from diverse data sources, such as online newspapers, Twitter and online blogs are proving to be extremely valuable resources in helping anticipate, detect, and forecast outbreaks of rare diseases. This paper presents SourceSeer, a novel algorithmic framework that combines spatio-temporal topic models with sourcebased anomaly detection techniques to effectively forecast the emergence and progression of infectious rare diseases. SourceSeer is capable of discovering the location focus of each source allowing sources to be used as experts with varying degrees of authoritativeness. To fuse the individual source predictions into a final outbreak prediction we employ a multiplicative weights algorithm taking into account the accuracy of each source. We evaluate the performance of SourceSeer using incidence data for hantavirus syndromes in multiple countries of Latin America provided by HealthMap over a timespan of fifteen months. We demonstrate that SourceSeer makes predictions of increased accuracy compared to several baselines and is capable of forecasting disease outbreaks in a timely manner even when no outbreaks were previously reported.
منابع مشابه
Forecasting Rare Disease Outbreaks with Spatio-temporal Topic Models
Rapidly increasing volumes of news, tweets, and blogs are proving to be extremely valuable resources in helping anticipate, detect, and forecast significant societal events. In this paper, we focus on the problem of forecasting rare disease outbreaks and demonstrate how spatio-temporal topic models over health-related newspaper articles can successfully be used to forecast outbreaks. More preci...
متن کاملEnsemble Forecasting for Disease Outbreak Detection
We describe a method to improve detection of disease outbreaks in pre-diagnostic time series data. The method uses multiple forecasters and learns the linear combination to minimize the expected squared error of the next day's forecast. This combination adaptively changes over time. This adaptive ensemble combination is used to generate a disease alert score for each day, using a separate multi...
متن کاملA systematic review of studies on forecasting the dynamics of influenza outbreaks
Forecasting the dynamics of influenza outbreaks could be useful for decision-making regarding the allocation of public health resources. Reliable forecasts could also aid in the selection and implementation of interventions to reduce morbidity and mortality due to influenza illness. This paper reviews methods for influenza forecasting proposed during previous influenza outbreaks and those evalu...
متن کاملSurveillance for foodborne disease outbreaks in Iran, 2006-2011
Background: The outbreaks of foodborne diseases is a major health problem and occur daily in all countries, from the most to the least developed. This study is the first report of foodborne outbreaks in Iran that carried out from 2006 to 2011. Methods: A retrospective, longitudinal study carried out using foodborne disease national surveillance system data from 2006-2011, which have been re...
متن کاملRemote sensing observation of annual dust cycles and possible causality of Kawasaki disease outbreaks in Japan
Kawasaki disease (KD) is a rare vascular disease that, if left untreated, can result in irreparable cardiac damage in children. While the symptoms of KD are well-known, as are best practices for treatment, the etiology of the disease and the factors contributing to KD outbreaks remain puzzling to both medical practitioners and scientists alike. Recently, a fungus known as Candida, originating i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015